1,926 research outputs found

    A semi-supervised clustering method for payload extraction

    Get PDF
    Master of ScienceDepartment of Electrical and Computer EngineeringDon M. GruenbacherWilliam H. HsuThis thesis addresses payload extraction, the information extraction task of capturing the text of an article from a formatted document such as a PDF file, and focuses on the application and improvement of density-based clustering algorithms as an alternative or supplement to rule-based methods for this task domain. While supervised learning performs well on classification-based subtasks of payload extraction such as relevance filtering of documents or sections in a collection, the labeled data which it requires for training are often prohibitively expensive (in terms of the time resources of annotators and developers) to obtain. On the other hand, unlabeled data is often relatively easily available without cost in large quantities, but there have not been many ways to exploit them. Semi-supervised learning addresses this problem by using large amounts of unlabeled data, together with the labeled data, to build better classifiers. In this thesis, I present a semi-supervised learning-driven approach for the analysis of scientific literature which either already contains unlabeled metadata, or from which this metadata can be computed. Furthermore, machine learning-based analysis techniques are exploited to make this system robust and flexible to its data environment. The overall goal of this research is to develop a methodology to support the document analysis functions of layout-based document segmentation and section classification. This is implemented within an information extraction system within which the empirical evaluation and engineering objectives of this work are framed. As an example application, my implementation supports detection and classification of titles, authors, additional author information, abstract, and the titles and body of subsections such as ‘Introduction’, ‘Method’, ‘Result’, ’Discussion’, ‘Acknowledgement’, ’Reference’, etc. The novel contribution of this work also includes payload extraction as an intermediate functional stage within a pipeline for procedural information extraction from the scientific literature. My experimental results show that this approach outperforms a state-of-the-field heuristic pattern analysis system on a corpus from the domain of nanomaterials synthesis

    A COMBINATORIAL APPROACH TO ROOT MULTIPLICITIES OF RANK 2 HYPERBOLIC KAC-MOODY ALGEBRAS

    Get PDF
    In this paper we study root multiplicities of rank 2 hyperbolic Kac-Moody algebras using the combinatorics of Dyck paths

    Progressive Processing of Continuous Range Queries in Hierarchical Wireless Sensor Networks

    Full text link
    In this paper, we study the problem of processing continuous range queries in a hierarchical wireless sensor network. Contrasted with the traditional approach of building networks in a "flat" structure using sensor devices of the same capability, the hierarchical approach deploys devices of higher capability in a higher tier, i.e., a tier closer to the server. While query processing in flat sensor networks has been widely studied, the study on query processing in hierarchical sensor networks has been inadequate. In wireless sensor networks, the main costs that should be considered are the energy for sending data and the storage for storing queries. There is a trade-off between these two costs. Based on this, we first propose a progressive processing method that effectively processes a large number of continuous range queries in hierarchical sensor networks. The proposed method uses the query merging technique proposed by Xiang et al. as the basis and additionally considers the trade-off between the two costs. More specifically, it works toward reducing the storage cost at lower-tier nodes by merging more queries, and toward reducing the energy cost at higher-tier nodes by merging fewer queries (thereby reducing "false alarms"). We then present how to build a hierarchical sensor network that is optimal with respect to the weighted sum of the two costs. It allows for a cost-based systematic control of the trade-off based on the relative importance between the storage and energy in a given network environment and application. Experimental results show that the proposed method achieves a near-optimal control between the storage and energy and reduces the cost by 0.989~84.995 times compared with the cost achieved using the flat (i.e., non-hierarchical) setup as in the work by Xiang et al.Comment: 41 pages, 20 figure

    Coulomb Driven New Bound States at the Integer Quantum Hall States in GaAs/Al(0.3)Ga(0.7)As Single Heterojunctions

    Full text link
    Coulomb driven, magneto-optically induced electron and hole bound states from a series of heavily doped GaAs/Al(0.3)Ga(0.7)As single heterojunctions (SHJ) are revealed in high magnetic fields. At low magnetic fields (nu > 2), the photoluminescence spectra display Shubnikov de-Haas type oscillations associated with the empty second subband transition. In the regime of the Landau filling factor nu < 1 and 1 < nu <2, we found strong bound states due to Mott type localizations. Since a SHJ has an open valence band structure, these bound states are a unique property of the dynamic movement of the valence holes in strong magnetic fields

    Baryonic Matter in the Hidden Local Symmetry Induced from Holographic QCD Models

    Full text link
    Baryonic matter is studied in the Skyrme model by taking into account the roles of π,\pi, ρ\rho, and ω\omega mesons through the hidden local symmetry up to O(p4)\mathcal{O}(p^4) terms including the homogeneous Wess-Zumino (hWZ) terms. Using the master formulas for the low energy constants derived from holographic QCD models the skyrmion matter properties can be quantitatively calculated with the input values of the pion decay constant fπf_\pi and the vector meson mass mρm_\rho^{}. We find that the hWZ terms are responsible for the repulsive interactions of the ω\omega meson. In addition, the self-consistently included O(p4)\mathcal{O}(p^4) terms with the hWZ terms is found to increase the half skyrmion phase transition point above the normal nucleon density.Comment: Contribution to SCGT12 "KMI-GCOE Workshop on Strong Coupling Gauge Theories in the LHC Perspective", 4-7 Dec. 2012, Nagoya Universit
    corecore